Published online 4 February 2014 



Nucleic Acids Research, 2014, Vol 42, No. 7 4363-4374 

doi:10.1093/nar/gkul07 



Dissecting the function of the adult p-globin 
downstream promoter region using an artificial 
zinc finger DNA-binding domain 

Joeva J. Barrow, Ying Li, Mir Hossain, Suming Huang and Jorg Bungert* 

Department of Biochemistry and IVIolecular Biology, Center for Epigenetics, Genetics Institute, Shands Cancer 
Center, Powell-Gene Therapy Center, University of Florida, Gainesville, 32610, FL, USA 

Received June 21, 2013; Revised January 9, 2014; Accepted January 11, 2014 



ABSTRACT 

Developmental stage-specific expression of the 
P-type globin genes is regulated by many cis- and 
frans-acting components. The adult p-globin gene 
contains an E-box located 60 bp downstream of 
the transcription start site that has been shown to 
bind transcription factor upstream stimulatory 
factor (USF) and to contribute to efficient in vitro 
transcription. We expressed an artificial zinc finger 
DNA-binding domain (ZF-DBD) targeting this site 
(+60 ZF-DBD) in murine erythroleukemia cells. 
Expression of the +60 ZF-DBD reduced the recruit- 
ment and elongation of RNA polymerase II (Pol II) at 
the adult p-globin gene and at the same time 
increased the binding of Pol II at locus control 
region (LCR) element HS2, suggesting that Pol II is 
transferred from the LCR to the globin gene pro- 
moters. Expression of the +60 ZF-DBD also 
reduced the frequency of interactions between the 
LCR and the adult p-globin promoter. ChlP- 
exonuclease-sequencing revealed that the +60ZF- 
DBD was targeted to the adult p-globin downstream 
promoter and that the binding of the ZF-DBD 
caused alterations in the association of USF2 con- 
taining protein complexes. The data demonstrate 
that targeting a ZF-DBD to the adult p-globin down- 
stream promoter region interferes with the LCR- 
mediated recruitment and activity of Pol II. 



INTRODUCTION 

Transcription of the adult (3-type globin gene is tightly 
regulated by distal and proximal c/^-regulatory DNA 
elements (1,2). Although proximal promoter elements 
govern developmental-stage and tissue-specific expression 
of the P-type globin genes, a locus control region (LCR), 
located far upstream of the genes, mediates high-level 



expression in differentiating erythroid cells (3-5). The 
LCR is composed of multiple DNase I hypersensitive 
sites (HS) that each interact with a combination of 
ubiquitously expressed or more cell type-restricted 
transcription factors (6). Among the proteins that bind 
to the LCR are the hematopoietic transcription factors 
GATA-1 and -2, NF-E2 and related proteins, KLFl and 
related proteins. Tall and the ubiquitously expressed 
proteins USF and Spl (6,7). The mechanisms by which 
the LCR mediates high-level transcription of the 
(3-globin genes are not completely understood. It is 
known that during differentiation of erythroid cells the 
genes come in proximity to the LCR (8,9). Furthermore, 
active chromatin marks and transcription complexes first 
associate with the LCR during differentiation of erythroid 
cells (10-13). This suggests that the LCR serves as the 
primary site of transcription complex recruitment and 
possibly assembly, and that the assembled transcription 
complexes are transferred to the globin genes (14). This 
model is supported by in vitro transfer experiments, 
however; in vivo evidence is lacking (15). 

The stage-specific expression of the adult P-globin genes 
is mediated by transcription factors that bind to proximal 
regulatory elements (16). KLFl binds to a CACCC site in 
the upstream promoter and activates P-globin gene 
expression by recruiting chromatin modifying 
activities and components of the basal transcription appar- 
atus (17-19). GATA factors have also been shown to 
recruit chromatin modifying coregulators and activities 
that estabUsh proximity between the LCR and the adult 
P-globin gene promoter (20,21). In contrast to these well- 
estabhshed transcription factors, the role of the E-box 
binding proteins Tall and USF are less well understood. 
Tall/SCL is a hematopoietic-specific bHLH protein that 
interacts with ubiquitous bHLH proteins such as E2A 
and HEB and binds E-box elements (22). In erythroid 
cells. Tall is part of a large transcription factor complex 
that includes GATA-1, its cofactor Fogl and the cofactors 
Ldbl and LM02 (23). There is evidence showing that Ldbl 
is important for establishing proximity between the LCR 
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and the adult P-globin gene promoter (24,25). Tall can be 
cross-linked to the adult P-globin promoter in erythroid 
cells but the precise binding site or whether it is recruited 
indirectly by other factors such as GATA-1 is not known 
(26,27). USF is a dimeric protein composed of either USFl 
or USF2 and both homo- and heterodimers exist and bind 
to E-box elements (28). USF interacts with a conserved 
E-box element located 60 bp downstream of the adult 
P-globin transcription start site in vitro and can also be 
cross-Hnked to the promoter in erythroid cells (29,30). 
Mutations of the +60 E-box reduced transcription of the 
human P-globin gene in vitro (29). An additional conserved 
E-box is located immediately downstream of the 
transcription start site (29). This E-box overlaps with the 
initiator and also interacts with USF in vitro. Expression of 
a dominant negative mutant USF (AUSF), which dimer- 
izes with USF proteins but does not bind DNA, reduces 
recruitment of RNA polymerase II (Pol II) at the P-globin 
gene locus (30). In addition, the expression of AUSF also 
resulted in impaired differentiation of erythroid cells in 
transgenic mice demonstrating a critical role of USF in 
adult P-globin gene regulation (31,32). 

C/^-element redundancy often presents a challenge in 
molecular biology especially when attempting to inter- 
rogate the functional potential of a c/^-element in vivo. 
Removal of c/^-regulatory elements by homologous re- 
combination is feasible but laborious and changes the 
sequence context (33,34). Artificial DNA-binding 
proteins comprising zinc finger domains can surmount 
this challenge (35,36). Zinc finger DNA-binding domains 
(ZF-DBD) bind to DNA in a sequence-specific manner 
with high affinity and neutralize a target site (36). A 
single zinc finger domain can recognize three base pairs 
of DNA — an interaction that can be extended by adding 
additional zinc finger domains in tandem to allow the rec- 
ognition of long asymmetric DNA sequences (37,38). The 
benefit of using ZF-DBDs includes rapid delivery and, 
more importantly, high resolution for c/^-element detec- 
tion. The high resolution allows targeting of sequences 
longer than 18 bp, long enough to bind ZF-DBDs with 
high affinity and specificity and to confer a unique signa- 
ture within the genome (38). However, unspecific binding 
of ZF-DBDs to low-affinity DNA binding sites is a 
concern that may be addressed to some degree using 
ZF-DBDs without effector domains. 

In this study, we designed and functionally 
characterized a ZF-DBD targeting and neutralizing the 
+60 E-box of the murine Pmaj-globin gene. Expression 
of the +60 ZF-DBD led to a reduction in Pmaj-globin 
gene expression in a murine erythroleukemia (MEL) cell 
line. We also observed a reduction in RNA Pol II recruit- 
ment at the Pmaj-globin gene relative to vehicle control 
cells. The association of Pol II and transcription factor 
NF-E2, which is a tissue-restricted transcription factor 
previously implicated in Pol II recruitment at the 
P-globin gene locus (7), with LCR element HS2 was 
increased in cells expressing the +60 ZF-DBD. 
Furthermore, expression of the +60 ZF-DBD impaired 
conformational changes in the P-globin gene locus that 
estabUsh proximity between the LCR and the adult 
Pmaj-globin gene promoter. Expression of a negative 



control (NC) ZF-DBD targeting a putative neutral site 
located between the Pmin- and the Pmaj-globin genes 
had no effect on Pmaj-globin gene transcription. 

MATERIALS AND METHODS 

Zinc finger design and construction 

The +60 ZF-DBD and the NC ZF-DBD were designed and 
polymerase chain reaction (PCR) assembled as described 
previously (36,39). The ZF-DBDs harbored six zinc 
fingers that were designed using the 'zinc finger tools' 
Web site (Mandell and Barbas. Zinc finger (ZF) tools, 
version 3.0. www.scripps.edu/mb/barbas/zfdesignhome. 
php). The oligonucleotides used for assembly and amplifi- 
cation are listed in Supplementary Tables SI and S2. Zinc 
finger domains 1-3 and 4-6 were assembled and amplified 
separately and ligated together with a linker region to con- 
struct the ZF-DBD coding fragment. We used the same 
assembly strategy PCR conditions as described by 
Barrow et al. (36). Sequences encoding for nuclear localiza- 
tion and FLAG tag were incorporated during the final 
amplification steps as outlined by Barrow et al. (36). The 
coding sequences for the ZF-DBDs were ligated into the 
Hpal site of the retrovirus vector pMSCV-neo (Clontech). 
The coding sequences for the +60 ZF-DBD and the NC 
ZF-DBD were sequenced before large-scale preparation of 
the plasmids. 

Cell culture and transfections/transductions 

MEL and Phoenix A cells were grown in 
Dulbecco's modified Eagle's medium (Cellgro) containing 
10% (vol/vol) fetal bovine serum (FBS) and 1% (vol/vol) 
penicillin/streptomycin (Cellgro). The cells were cultured 
at 37°C in the presence of 5% CO2 at a density of 2 x 10^ 
cells/ml. MEL cells were induced to differentiate in the 
presence of 2% (vol/vol) dimethyl sulfoxide (DMSO) for 
72 h. For transfections, the pMSCV-neo vector or the 
pMSCV-neo vector containing the coding sequences for 
the +60 E-box or the NC ZF-DBD was transfected into 
the Phoenix A packaging cell line using Lipofectamine 
2000 (Invitrogen) according to a protocol provided by 
the manufacturer. After 48 h incubation, cells were 
centrifuged and the supernatant containing replication-de- 
fective virus was treated with 2|ig/ml polybrene (Sigma) 
and added to MEL cells. After 48 h incubation, the 
transduced MEL cells were selected by incubation in the 
presence of 100 |ig/ml geneticin (Cellgro). Single cell clones 
were selected by dilution of the pool of stably transduced 
MEL cells and seeding of single cells into 96-well culture 
dish. Colonies were then transferred to larger tissue 
culture dishes and expanded. 

Immunoblot analysis 

Immunoblot analysis was performed as described in 
Barrow et al. (36). Nuclear and cytoplasmic proteins 
were isolated using the NE-PER kit (Thermo Scientific) 
according to the protocol provided by the manufacturer 
and pubHshed procedures (40). For immunoblot analysis, 
10-20 |ig of protein extracts from MEL cells or from 
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cytoplasmic or nuclear fractions was loaded onto 4-15% 
(wt/vol) TGX Tris-HCl gels (Bio-Rad), separated by 
sodium dodecyl sulfate-polyacrylamide gel electropho- 
resis, transferred to polyvenylidenefluoride (PVDF) mem- 
branes and incubated with rabbit anti-ZF sera (a gift from 
Carlos Barbas, Scripps Research Institute, La Jolla, CA, 
USA), mouse anti-BRGl (H-88, Santa Cruz, sc-10768), 
rabbit anti-USF2 (N-18, Santa Cruz, sc-861) or mouse 
anti-Tubulin (G-8, Santa Cruz, sc-55529). After several 
washes, the membranes were incubated with secondary 
anti-mouse or anti-rabbit antisera (Santa Cruz), and 
proteins were detected using enhanced chemiluminescence 
(ECL) reagent (Millipore) and visualized using X-ray films 
(Kodak). 

RNA isolation and quantitative reverse 
transcription-PCR analysis 

RNA isolated from MEL cells using the RNeasy kit 
(Qiagen) was reverse transcribed using the iScript com- 
plementary DNA synthesis kit (Bio-Rad) and subjected 
to quantitative PCR analysis as described previously (36) 
using the following primers: mouse (3-major, upstream 
(US) 5^-CACATTTGCTTCTGACATA-3^ downstream 
(DS) 5^-GCAGAGGCAGAGGATAGGTC-3^; mouse 
dematin, US 5^-ACCGCATGAGGCTTGAGAGG-3^ 
DS 5^-TCTTCTTAAGTTCGTTCCGCTTCC-3^; mouse 
P-spectrin, US 5^-GCTTAAGGAACGCCAGACTCCA 
G-3^ DS 5^-ATTTCTCCTGCTCGTCTTTGT-3^; mouse 
P-actin US 5^-GTGGGCCGCTCTAGGCACCA-3^ DS 
5^-TGGCCTTAGGGTGCAGGGGG-3^; mouse GATAl 
US 5^-CACTCCCCAGTCTTTCAGGTGTA-3^ DS 5^-G 
GTGAGCCCCCAGGAATT-3^; mouse Runxl US 5^-CG 
GTAGAGGCAAGAGCTTCA-3^ DS 5^-GATGTCTTC 
GGGGTTCTCGG-3^; Mouse DOK2 US 5^-GAAGCTG 
CGATGGTCAGGAT-3^ DS 5^-GCCACTTCTTGCCA 
AAGGTC-3^; mouse Ldbl US 5^-TGTGCCTGTCCTG 
GTTGTTC-3^ DS 5^-GTGGGTACATGGGAGTTG 
GG-3^; mouse s/y US 5^-GGCCTGTGGAGTAAGGTC 
AA-3^ DS 5^-GCAGAGGACAAGTTCCCAAA-3^; 
mouse glyceraldehyde 3-phosphate dehydrogenase 
(GAPDH) US 5^-CCAAGGTCATCCATGACAACT-3^ 
DS 5^-ATCACGCCACAGCTTTCC-3^ All reverse tran- 
scriptase-polymerase chain reaction data were normalized 
to mouse P-actin or mouse GAPDH levels. 

Chromatin immunoprecipitation and ChlP-exonuclease 
sequencing 

Chromatin immunoprecipitation (ChIP) assays were 
performed as described previously (36). Briefly, 2 x 10^ 
MEL cells were incubated in 2% formaldehyde for 
lOmin. After quenching the reaction with 0.125 M 
glycine for 5min, cells were lysed and cross-linked 
chromatin was subjected to sonication using conditions 
yielding 200-500 bp fragments. The lysates were then 
incubated with mouse IgG (Santa Cruz; sc-2025) at 4°C 
for 2 h and precleared with protein A sepharose beads (GE 
healthcare; CL-4B). Precleared lysates were centrifuged at 
1700^ for lOmin at 4°C. The supernatant was incubated 
overnight at 4°C on a rotating wheel with the following 
specific antibodies: 10|ig of RNA-Pol II [C- terminal 



domain (CTD) 4H8; Millipore], RNA-Pol II PS2 
(Abeam; Ab 5095), Tal 1 (C-21, Santa Cruz 
Biotechnology, sc-12984X), USF2 (N-18, Santa 
Cruz Biotechnology, sc-861X), Brg-1 (H-88, Santa Cruz 
Biotechnology, sc-10768X) or 10|il of ZF antibody 
(a gift from Dr Carlos Barbas, Scripps Research Institute, 
CaHfornia). After several washes with high and low salt 
buffers, the samples were subjected to phenol/chloroform/ 
isoamylalcohol and chloroform extractions, and the DNA 
was precipitated by adding 2.5 x volume of 100% ethanol 
(vol/vol) and 10|ig glycogen (Invitrogen) overnight 
at — 20°C. The DNA precipitates were washed with 
70% ethanol (vol/vol), resuspended in 10 mM Tris-Cl, 
pH 8.5, and subjected to quantitative (q) PCR as 
described previously (36) using the following primers: 
mouse (3maj-globin, US 5^-AAGCCTGATTCCGTAGA 
GCCACAC-3^ DS 5^-CCCACAGGCCAGAGACAG 
CAGC-3^; mouse HS2 US 5^-TGCAGTACCACTGTCC 
AAGG-3^ DS 5^-ATCTGGCCACACACCCTAAG-3^; 
NC US 5^-CTAGAGACCCATGATTGA-3^ DS 5^-TCA 
ATCATGGGTCTCTAG-3^; mouse Pmaj 3^region, US 
5^-GCTCTTGCCTGTGAACAATG-3^ DS 5^-TGCTTT 
TTATTTGTCAGAAGACAG-3^ 

For ChlP-exonuclease sequencing (ChlP-Exo-Seq), we 
followed a protocol provided by Peconics and published 
by Rhee and Pugh (41). The MEL cell clone 15 expressing 
higher levels of the +60 ZF-DBD and the control cells 
harboring the vector only were subjected to cross-linking 
and sonication as described under the ChIP protocol. 
Sonication yielded fragments between 100 and 500 bp. 
For each cell clone 1.8 x 10^ million cells were processed 
and divided into nine aliquots representing ~2xl0^ 
milHon cells each. Three ahquots per antibody were sent 
to Peconics for ChlP-Exo-Seq. The immunoprecipitation 
step was performed using 5 |ig of antibody per 10^ cells. 
The immunoprecipitated chromatin was then subjected to 
exonuclease treatment and high-throughput sequencing; 
these procedures were performed by Peconics. Bam-files 
and Bed-graph files provided by Peconics were analyzed 
using the IGV2.3 genome browser [http://broadinstitute. 
org, (42,43)]. 

Chromatin conformation capture 

The chromatin conformation capture (3C) assay was 
performed as described previously with minor modifica- 
tions (8,44). Briefly, 4 x 10^ cells were cross-linked with 
2% formaldehyde for lOmin and stopped by the 
addition of glycine at a final concentration of 0.125 M. 
Cells were pelleted, washed twice with cold phosphate- 
buffered saline and lysed in lysis buffer (10 mM Tris, 
pH8.0, 10 mM NaCl, 0.2% NP40 and protease inhibitors) 
at 4°C for 90min while rotating. The nuclei were collected 
by centrifugation, washed with appropriate 1 x restriction 
enzyme buffer (Buffer 3, New England Biolabs. NEB) and 
then resuspended in restriction buffer 3 (NEB) containing 
0.3% sodium dodecyl sulfate (SDS) at 37°C for 1 h while 
shaking. Triton-X-100 was added at a final concentration 
of 1.8% to sequester SDS, and incubation proceeded for 
another 1 h at 37°C. The chromatin was incubated with 
300 U Bgl II (NEB) at 37°C overnight while shaking, and 
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the reaction was stopped by adding SDS to a final 
concentration of 1.6% and incubating at 65°C for 
30min. The samples were subsequently diluted with 
ligation reaction buffer to a final volume of 1 ml. The 
ligation reaction buffer contained 1% Triton X-100, 
50 mM Tris, pH 7.5, 10 mM MgCl2, 10 mM dithriotheitol 
(DTT), O.lmg/ml bovine serum albumin and 1 mM 
adenosine triphosphate. T4 DNA ligase (600 U, NEB) 
was added and the reaction was incubated at 16°C for 
5 h followed by Hgation for 30 min at room temperature. 
After reversal of the cross-link, the DNA was isolated, 
ampHfied and cloned into pCR4-TOPO vector 
(Invitrogen) for sequencing. To control for PCR amplifi- 
cation efficiency of different primer pairs, equimolar 
amounts of plasmids containing the PCR products were 
mixed together to generate the control templates. PCR 
reactions were performed with control and experimental 
templates amplified in parallel. To control for potential 
differences in template quality, the interactions at the (3- 
globin locus from the experimental templates were 
normalized to interaction frequencies with the GAPDH 
locus, which were low and quantified with ImageQuant. 
The calculation of the relative cross-linking frequency 
between two loci was performed as described by Tolhuis 
et al. (8). 



RESULTS 

ZF-DBDs derived from the Barbas modules were designed 
to neutralize the +60 E-box or to bind a NC region 
located downstream of the (3maj-globin gene. The 
designed ZF-DBDs are composed of six zinc finger 
domains to target 18-bp regions harboring the target 



sites (Figure lA). The coding region for the ZF-DBD 
together with FLAG and nuclear localization signal tags 
were Hgated into the retroviral pMSCV vector. MEL cells 
were then transduced to generate stable populations for 
the NC ZF-DBD through geneticin sulfate selection. 
Single cell clones stably expressing either lower levels or 
higher levels of the +60 E-box ZF-DBD protein were 
selected to evaluate the effect of different concentrations 
of the ZF-DBD on transcription factor binding and 
Pmaj-globin gene expression. The NC ZF-DBD was 
designed to interact with a putative neutral DNA 
sequence located in between the Pmaj- and Pmin-globin 
genes. The NC ZF-DBD served as a NC and was not 
expected to change expression of the adult Pmaj-globin 
gene. The +60 ZF-DBD was designed to interact with 
an E-box located 60 bp downstream of the Pmaj-globin 
transcription start site. We expected the +60 ZF-DBD to 
prevent the binding of transcription factor USF to this site 
and to reduce Pmaj-globin gene transcription. 

To confirm nuclear localization, MEL cells harboring 
the NC- and the +60 ZF-DBDs were fractionated into 
cytoplasmic and nuclear fractions (Figure IB and C). 
The nuclear protein BRGl and the cytoplasmic protein 
P-tubuHn were used as controls to indicate complete 
fractionation. The data demonstrate that the ZF-DBDs 
localized to the nucleus and that there was no expression 
of ZF-DBDs in cells harboring the empty vector. Two 
single cell clones expressing the +60 ZF-DBD at different 
levels were isolated and expanded. The data show that 
clone 19 expressed lower levels of the +60 ZF-DBD 
compared with clone 15 (Figure IB). 

To determine the effect of the ZF-DBDs on adult Pmaj- 
globin expression, RNA was extracted from uninduced 
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Figure 1. Synthetic ZF-DBDs localize to the nucleus in MEL cells. (A) Target sites for the NC and +60 ZF-DBDs are shown relative to the 
transcription start site of the Pmaj-globin gene (not drawn to scale). (B) Compartmentalization immunoblot where cells were fractionated into 
cytosolic (C) and nuclear (N) fractions. Brgl and tubuhn (top and middle panel, respectively) represent controls. Two single cell clones reflecting low 
and medium protein expressing levels (clones 19 and 15, respectively) both localize the ZF-DBD to the nuclear fraction in MEL cells. 
(C) Compartmentalization immunoblot showing that the NC ZF-DBD protein in MEL cell populations C4 and D2 localized to the nucleus 
(bottom panel). 
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and DMSO-induced MEL cells and reverse transcribed. 
Quantitative PCR analysis in stable MEL cells expressing 
the NC ZF-DBD revealed that this ZF-DBD did not 
affect (3maj-globin gene expression in either uninduced 
or DMSO-induced MEL cells (Figure 2A). In contrast, 
Pmaj-globin gene expression significantly decreased in 
cells expressing the +60 ZF-DBD compared with vehicle 
control cells (Figure 2B). Furthermore, the decrease in 
Pmaj-globin gene expression occurred in a dose-dependent 
manner relative to the concentration of the +60 ZF-DBD 
present (Figure 2B); clone 15, which expressed higher 
levels of the +60 ZF-DBD (Figure IB) revealed lower 
levels of Pmaj-globin gene expression compared with 
clone 19, which expressed lower levels of the protein. To 
analyze the specificity of the +60 ZF-DBD, additional 
E-box containing erythroid-specific genes such as 
dematin and P-spectrin were evaluated. The data show 
that expression of the +60 ZF-DBD did not affect 



expression of the control genes (Figure 2B). The data dem- 
onstrate that the +60 ZF-DBD specifically reduced ex- 
pression of the adult Pmaj-globin gene without affecting 
expression of other erythroid-specific genes. 

Next to confirm the specificity of the ZF-DBDs, we 
examined target occupancy by ChIP in induced MEL 
cells. The +60 ZF-DBD associated with the Pmaj-globin 
promoter, with a higher level of occupancy in clone 15 
compared with clone 19 (Figure 2C). The association of 
the +60 ZF-DBD with the NC-region or LCR HS2 was 
lower. The NC ZF-DBD associated with the NC-region in 
the pool of transduced MEL cells (Figure 2D). We 
detected a low level of occupancy at LCR HS2 that was 
not statistically significant. 

To analyze the consequence of neutralizing the +60 
E-box on the binding of transcription factors involved in 
Pmaj-globin gene expression, we performed a series of 
Chip experiments. First, we examined candidate 
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Figure 2. Expression of the +60 ZF-DBD reduces Pmaj-globin gene expression. (A) The qPCR analysis of Pmaj-globin gene transcription in 
uninduced (UN) or DMSO-induced (IN) MEL cells expressing the NC ZF-DBD (NCZF) or harboring the vector control (Vector). (B) The 
qPCR analysis of Pmaj-globin, dematin and spectrin gene transcription in uninduced (UN) or DMSO-induced (IN) MEL cells harboring the 
vector (Vector) or expressing the +60 ZF-DBD (+60) in the two different clonal cell hues (clone 15 and 19). The data presented in (A) and (B) 
resulted from two independent RNA extractions with the qPCR performed in tripHcate +SEM. Statistical analysis was based on the Student's /-test 
(*/'<0.05). (C) Chip analysis of the occupancy of the +60 ZF-DBD to selected sites in the murine P-globin gene locus including the NC Region, 
LCR HS2 (HS2) and the Pmaj-promoter (P-maj Pro). Two single ceU clones expressing low (clone 19) or high (clone 15) levels of the +60 ZF-DBD 
or a pool of cells harboring the empty vector (Vector B) was analyzed. (D) ChIP analysis of NC ZF-DBD occupancy at the indicated regions in the 
P-globin locus. Vector B denotes control cells harboring the empty vector. The data in (C) and (D) represent two independent ChIP experiments with 
the PCR performed in triplicate +SEM. Statistical analysis was based on Student's /-test (*P<0.05). 
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transcription factors that are known to bind to E-box 
sequences such as USFl and USF2 as well as Tall. 
8Both USF (Figure 3A) and Tall (Figure 3B) occupied 
both HS2 of the LCR and the (3-major promoter in 
induced vehicle-treated control MEL cells. Surprisingly, 
the associations of USF2 (Figure 3 A) and that of Tall 
(Figure 3B) with the adult (3maj-globin gene promoter 
were not affected in cells expressing the +60 ZF-DBD 
compared with vehicle control cells. 

The binding of the +60 ZF-DBD downstream of the 
transcription start site could interfere with the recruitment 
and/or the elongation of transcription complexes at the 
adult (3maj-globin gene promoter. To distinguish 
between these possibiHties, we examined Pol II occupancy 
at the (3-globin gene locus in induced MEL cells harboring 
the vector or the construct expressing the +60 ZF-DBD 
(Figure 4). We observed a decrease in the recruitment of 
Pol II at the adult (3maj-globin gene promoter in cells 
expressing the +60 ZF-DBD (Figure 4A). The decrease 
in Pol II occupancy was more pronounced at the 3^end 
of the (3maj-globin gene suggesting that expression of the 
+60 ZF-DBD affects both recruitment and elongation of 
Pol II. Interestingly, compared with control cells, cells ex- 
pressing the +60 ZF-DBD revealed enhanced association 
of Pol II at LCR HS2. We next analyzed recruitment of 
Pol II phosphorylated at serine 2 of the CTD (Figure 4B). 
CTD serine 2 phosphorylation occurs during transcription 
elongation. Associations of the phosphorylated form of 
Pol II with LCR HS2 also increased in cells expressing 
the +60 ZF-DBD. We did not observe a decrease in 
association of the serine 2 phosphorylated form of Pol II 
with the (3maj-globin promoter showing that the defect in 
elongation is not due to a failure to phosphorylate the 
serine 2 residue of the Pol II C-terminal domain. The 
data strongly suggest that the binding of the +60 
ZF-DBD at the Pmaj-globin downstream promoter 
element interfered with the recruitment and elongation 
of transcription complexes at the (3-globin gene 
promoter. Expression of the NC ZF-DBD did not affect 
recruitment of Pol II at the Pmaj-globin gene promoter 



(Figure 4C), demonstrating that ZF-DBDs only exert 
effects if targeted to specific sites. 

Previous studies have shown that transcription factor 
NF-E2 associates with LCR HS sites as well as with the 
Pmaj-globin promoter and mediates the transfer of Pol II 
from the LCR to the P-globin gene (15,45,46). We 
examined the association of NF-E2 with LCR HS2 and 
with the adult Pmaj-globin gene promoter in cells 
expressing the +60 ZF-DBD using ChIP (Figure 4D). 
The results show that expression of the +60 ZF-DBD 
increased the interaction of NF-E2 with LCR HS2, 
similar to what we observed for Pol II. The binding of 
NF-E2 at the Pmaj-globin promoter was low, and 
changes in cells expressing the +60 ZF-DBD were not 
statistically significant. 

Next, we examined proximity between the adult 
Pmaj-globin promoter and the LCR using the 3C 
procedure (Figure 5A). It has been shown previously 
that the P-globin locus undergoes a conformation 
change that juxtaposes the Pmaj-globin promoter and 
the LCR in cells in which the gene is expressed (8,9). We 
performed the 3C assay in cells expressing the vehicle only 
and in the clonal +60 ZF-DBD cell line 15. The data in 
Figure 5B show that Bgl II digestion efficiency at the 
various regions of the P-globin locus analyzed in the 3C 
experiment was similar in the two different cell lines 
analyzed. The 3C data shown in Figure 5 A reveal a 
reduced interaction between LCR HS2 and the adult 
Pmaj-promoter in induced MEL cells expressing the +60 
ZF-DBD. Essentially no interactions were observed 
between the LCR and the PHl gene, which is not 
expressed in MEL cells, or between the LCR and a 
region far downstream of the Pmaj-globin gene. 

To address the specificity of the +60 ZF-DBD and to 
examine if binding of the +60 ZF-DBD alters the 
interaction pattern of USF with the Pmaj-globin gene 
promoter, we subjected MEL cells expressing or not 
expressing the ZF-DBD to ChlP-Exo-Seq (Figure 6). 
ChlP-Exo-Seq is related to ChlP-Seq but contains an 
additional step in which accessible DNA is digested with 
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exonuclease leaving chromatin fragments that are bound 
by proteins and thus protected from exonuclease digestion 
(41). We used three different antibodies in these experi- 
ments, a NC IgG antibody, an antibody specific for the 
ZF-DBD backbone and an antibody specific for USF2 
(Figure 6). The data demonstrate that the ZF-DBD 
antibody is highly specific; it did not yield any peaks 
above the IgG background in cells not expressing the 
+60 ZF-DBD. Importantly, cells expressing the +60 
ZF-DBD revealed two prominent peaks associated with 
the adult Pmaj- and (3min-globin gene promoters. The 
Pmin- and Pmaj-globin downstream promoters contain 
identical +60 ZF-DBD target sequences. Strikingly, the 
ChlP-Exo-Seq pattern for the +60-ZF-DBD in the 
Pmaj-globin downstream promoter region demonstrates 
that protection from exonuclease digestion starts 
immediately at the 3^ end of the targeted sequence as 
indicated by the black bar in the lower right of Figure 6. 
This demonstrates that the +60-ZF-DBD was interacting 
specifically with the targeted sequence. The increased 
sequence reads over a broad range of the promoter, and 
upstream regions Hkely reflect binding of other proteins 
that are cross-linked together with the +60 ZF-DBD at the 
adult Pmaj-globin promoter. We also subjected MEL cells 



expressing or not expressing the +60 ZF-DBD to ChlP- 
Exo-Seq using antibodies against USF2. Expression of the 
+60 ZF-DBD changed the USF2 ChlP-Exo-Seq pattern at 
the adult Pmaj-globin gene, suggesting an altered inter- 
action mode of USF2 containing protein complexes with 
the adult globin gene promoter (Figure 6). 

There were additional +60 ZF-DBD peaks associated 
with the P-globin gene locus, one prominent peak 
upstream of the P-minor globin gene and three peaks 
upstream of the embryonic s/y-globin gene, two of 
which corresponding to LCR elements HSl and HS2 
(Figure 6, top). Genome wide the +60 ZF-DBD interacted 
with other sites mostly located in intergenic regions or in 
introns. Most of these interactions are Hkely due to the 
presence of low-affinity binding sites or due to opportun- 
istic residence at highly accessible regions at which lack of 
competition led to relatively high cross-linking 
frequencies. We examined expression of five additional 
genes that were associated with +60 ZF-DBD binding 
peaks in intronic or upstream promoter regions, including 
the embryonic e/y-globin, Runxl, Ldbl, DOK2 and 
GATAl genes (Figure 7A). None of these genes revealed 
changes in expression levels between ZF-DBD expressing 
and non-expressing cells that were comparable with the 
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reduction in expression of the adult (3maj -globin gene 
(Figures 2B and 7A). 



DISCUSSION 

In this study, we examined the functional role of the adult 
(3maj-globin downstream promoter region using an 
artificial ZF-DBD targeting an E-box. Using artificial 
ZF-DBDs has several advantages over recombination- 
based strategies including high resolution and the abihty 
to titrate the ZF-DBD so that either partial or complete 
elimination of the function of a c/^-regulatory DNA 
element can be achieved. However, one concern of 
applying artificial ZF-DBDs for modulating gene 
expression is the fact that they can bind to unspecific 
DNA sequences harboring partial sequence homology to 
the targeted sequence. Therefore, we use ZF-DBDs 
without effector domains to minimize unspecific effects 
(36). We demonstrate in this study that a ZF-DBD 
targeting the +60 E-box interferes with the recruitment 
of Pol II to the (3maj-globin gene promoter and reduces 
transcription of that gene. The data show that the negative 



effects exerted by the +60 ZF-DBD were sensitive to the 
expression levels of ZF-DBD expression. A ZF-DBD 
targeting a NC region far downstream of the Pmaj-globin 
gene had no effect on gene expression demonstrating that 
ZF-DBDs have to be targeted to specific sites to exert a 
function. 

The ZF-DBDs used in this study harbored six ZFs 
specifying 18-bp DNA target sequences. Much Hke 
transcription factors in general, artificial ZF-DBDs 
interact with other sites with lower affinity. To determine 
the extent of ZF-DBD interactions in the MEL cells, we 
subjected +60 ZF-DBD expressing cells to ChlP-Exo-Seq 
(Figure 6). The data show specific interactions of the 
ZF-DBD with the adult P-globin gene promoters. 
However, the +60 ZF-DBD interacts with other sites in 
the MEL genome. Most of these sites are located in 
introns or intergenic regions and do not always reveal 
sequence similarity to the targeted sequence. We believe 
that most of the chromatin interactions of the ZF-DBD in 
MEL cells are due to the presence of low-affinity binding 
sites or caused by prolonged occupancy of highly access- 
ible regions in pursuit of finding a binding site. Lack of 



Nucleic Acids Research, 2014, Vol. 42, No. 7 4371 



I 

Range: 0-50 

Mel+ZF, aIgG 



100 kb 



MEL-ZF, aZF 



MEL+ZF, aZF 



I 

Olfr68 



Range 
0-50 



0-3 



Olfr67 



H 

Hbb-minor 



1.^ lL .-..^.j — . I .... 1- . li jU L . 

I I I II 



HI 



HI H 



HI 



MEL+ZF, aZF 



MEL-ZF, aUSF2 
MEL+ZF, aUSF2 



Hbb-major Hbb-bh1/bh2 Hbb-y HS 1 2 3 4 5 

— ' 1 ' 

LCR 



Range 



Hbb-major 



0-4 


Mel+ZF, dgG 


0-4 


MEL-ZF, aUSF2 


^^^^^^^^^^^^^^^ 


■ MEL+ZF, aUSF2 






^^^IVIELJ^, aZF 




T C T : A G C A T C A '3 T C A G G T G C A C C A T G A T G T C T G T T T C T G G G G T T '3 T 


3AGTCAACACAACTATGTCAGAJ 




18bp ZF-DBD target 



Figure 6. ChlP-Exo-Seq analysis of +60 ZF-DBD and USF2 occupancy in the P-globin gene locus in MEL cells. MEL cells expressing the +60 ZF- 
DBD (MEL+ZF) or MEL cells not expressing a ZF-DBD (MEL— ZF) were subjected to ChlP-Exo-Seq using antibodies against the ZF-DBD 
background (aZF), USF2 (aUSF2) or NC IgG (aIgG). Bed-graph files were analyzed using the Integrated Genomics Viewer (42,43). Shown on top is 
the mouse P-globin gene locus and flanking olfactory receptor (Olfr) genes. On the lower left is a magnified view of the adult Pmaj-globin gene 
showing ChlP-Exo-Seq patterns for the +60 ZF-DBD and USF2. Further magnification showing the DNA sequence of the adult Pmaj-globin 
downstream promoter region is shown on the lower right. The black bar indicates the target sequence of the +60 ZF-DBD. 



competition would lead to efficient cross-linking of the 
ZF-DBDs at these sites. We examined several genes that 
revealed binding peaks for the +60 ZF-DBD in promoter 
or intronic regions, including genes encoding GATAl, 
Runxl, Ldbl and Dok2 (Figure 7A). The +60 ZF-DBD 
did not change expression of any of these genes in 
MEL cells demonstrating that the binding of the 
ZF-DBD at most sites is without functional consequence. 
Nevertheless, the data highlight the importance of using 
ZF-DBDs without effector domains. 

Previous studies have identified several DNA sequence 
elements in the adult (3-globin downstream promoter 
region that contribute to efficient transcription. A 
sequence around +20 has been shown to mediate inter- 
actions with the TFIID complex, whereas the +60 E-box 
interacts with USFl and two heterodimers (29,47). Both 
of these elements contribute to efficient in vitro transcrip- 
tion of the human (3-globin gene. There is another E-box 
located immediately downstream of the transcription start 
site that interacts with USFl in vitro (29). However, this 
site was not important for in vitro transcription of the 
human P-globin gene (29). Importantly, USF has also 
been shown to associate with the murine Pmaj-globin 
gene promoter in the context of erythroid cells (30,44). 
Furthermore, expression of a dominant negative form of 



USF decreased transcription complex recruitment and 
Pmaj-globin gene expression in murine MEL cells and in 
transgenic mice (30,31). The collective data suggest that 
USF interacts with the +60 E-box and mediates the 
recruitment of transcription complexes to the adult 
P-globin gene promoter. In support of this conclusion, 
expression of the +60 ZF-DBD in MEL cells reduced 
recruitment of Pol II and transcription of the Pmaj-globin 
gene. However, we did not observe a decrease in the 
binding of USF at the adult Pmaj-globin gene promoter 
in cells expressing the +60 ZF-DBD. One possible 
explanation for this observation is that Tall, a hem- 
atopoietic-specific E-box binding protein, mediates the 
positive role of the +60 E-box in erythroid cell. 
However, the association of Tall with the adult 
Pmaj-globin gene promoter was also not impaired in 
cells expressing the +60 ZF-DBD (Figure 4B). To 
examine in more detail if binding of the +60 ZF-DBD 
altered the interaction pattern of USF2 with the adult 
Pmaj-globin gene promoter, we subjected the cells to 
ChlP-Exo-Seq (Figure 6). The ChlP-Exo-Seq pattern of 
USF2 at the adult P-globin promoter was different in cells 
expressing the +60 ZF-DBD compared with control MEL 
cells, suggesting an altered binding conformation of USF2 
containing protein complexes. It is possible that the 



4872 Nucleic Acids Research, 2014, Vol. 42, No. 7 



A 1.6 



1.4 



□ MEL_C 

■ MEL_ZF 

■ MEL_C_lnduced 

□ MEL ZF Induced 





LCR 



p-globin | 
Pol II 

Elongating Pol Hi 



I USF2 
I NF-E2 
k+60 ZF-DBD 



GATA1 RUNX1 



D0K2 



Figure 7. Expression analysis of erythroid genes in MEL cells expressing or not expressing the +60 ZF-DBD and model of the impact of the ZF- 
DBD on transcription complex assembly and function at the adult Pmaj-globin gene. (A) The qPCR analysis of s/y-globin, GATAl, Runxl, Ldbl 
and DOK2 gene transcription in uninduced (MEL) or induced (MEL-induced) MEL cells expressing (MEL_ZF) or not expressing (MEL_C) the 
+60-ZFDBD. The data resulted from two independent RNA extractions with the qPCR performed in tripHcate ±SEM. Statistical analysis was based 
on the Student's /-test. None of the changes in messenger RNA levels were statistically significant {P > 0.05). (B) Model for the impact of the +60 
ZF-DBD on transcription complex assembly and function at the adult Pmaj-globin gene. 



reduced binding and activity of Pol II at the adult 
(3maj-globin gene promoter in cells expressing the +60 
ZF-DBD allows USF2 and associated proteins to bind 
to the E-box at the transcription start site (TSS) (29). 

Regardless of how the +60 ZF-DBD interferes with the 
activity of USF, its binding at the Pmaj -downstream 
promoter reduced efficient recruitment and elongation of 
transcription complexes. We and others previously 
hypothesized that transcription complexes are first 
recruited to the LCR, which is an extremely accessible 
region and contains a high density of transcription 
factor binding sites (48,49). Pol II can be cross-Hnked to 
LCR HS elements during erythroid differentiation before 
it is detectable at the P-type globin gene promoters (12,13). 
Pol II transcription complexes may thus be assembled at 
the LCR and then transferred to the globin gene 
promoter. USF and the erythroid transcription factor 
NF-E2 have been impHcated in the transfer of Pol II 
from the LCR to the adult P-globin gene promoter 
(12,15,49). The data presented in this study demonstrate 
that although Pol II recruitment to the Pmaj-globin gene 
promoter is impaired, the binding of Pol II at LCR HS2 is 
increased in cells expressing the +60 ZF-DBD. This 
supports the hypothesis that the LCR is the primary site 
of transcription complex recruitment. Interestingly, we 
also detected an increase in the association of NF-E2 
with LCR HS2 in cells expressing the +60 ZF-DBD. 
The binding of NF-E2 at the promoter was low, and 
possible changes in binding at this site were not statistic- 
ally significant. Nevertheless, the data provide further 
support of a model in which Pol II is transferred from 
the LCR to the adult P-globin gene promoter in a 



process facilitated by USF and NF-E2 (15,49). We had 
previously shown that USF and NF-E2 physically 
interact with each other in erythroid cells (15). In a 
recent study, examining chromatin accessibihty using a 
high resolution DNase-I in vivo footprinting assay, inves- 
tigators provided evidence suggesting that NF-E2 and 
USF interact with each other and that NF-E2 recruits 
USF indirectly to distal elements, whereas USF recruits 
NF-E2 indirectly to sites in the vicinity of transcription 
start sites (50). ChlP-Exo-Seq yielded a low number of 
sequence reads for USF2 selected Pmaj-globin fragments. 
Perhaps, USF only interacts transiently with the Pmaj- 
downstream promoter to recruit the transcription 
complex from the LCR and to stimulate early transcrip- 
tion elongation. NF-E2 could reach out from the LCR to 
interact with USF2 and this could facihtate the transfer of 
Pol II. The binding of the +60 ZFDBD could specifically 
interfere with the precise architecture of USF2/NF- 
E2-mediated interactions and consequently recruitment 
of transcription complexes (Figure 7B). 

Many recent studies have demonstrated that the P-globin 
gene locus changes conformations during differentiation of 
erythroid cells and activation of the globin genes (8,9,21,24). 
How proximity between the LCR and the adult P-globin 
gene is established is not known but requires proteins 
known to interact with GATA-1 or the GATA-l/Tall 
protein complex, Fogl and LdBl (21,24). In addition, the 
histone arginine methyltransferase Prmtl, which is recruited 
to the adult Pmaj-globin gene promoter by USF, has been 
shown to be required for establishing proximity between the 
LCR and the adult globin promoter (44). Finally, the ad- 
enosine triphosphate-dependent chromatin remodeling 
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enzyme Brgl has also been shown to mediate proximity 
between the adult (3-globin gene and the LCR (51). We 
demonstrate that the binding of the +60 ZF-DBD to the 
downstream promoter interfered with the recruitment of Pol 
II and establishment of proximity between the LCR and the 
Pmaj-globin gene promoter. All of the proteins involved in 
mediating proximity between the LCR and the adult (3- 
globin gene promoter also reduce the recruitment of tran- 
scription complexes to the gene (21,24,44,51). Perhaps what 
mediates interactions between the distal element and the 
promoter is the transfer of transcription complexes, an 
event that may be captured by the cross-linking reaction. 
This would be consistent with recent data suggesting that 
the fraction of erythroid cells exhibiting interactions 
between the LCR and the adult (3-globin gene promoter in 
3 C assays is low (52). 

Grosveld et al. recently demonstrated that Pol II 
transcription initiates within transcription factories and 
that the elongating polymerase moves away from these 
loci (53). Furthermore, Groudine et al. showed that the 
LCR is required to anchor the (3-globin gene locus at 
transcription factories and also mediates efficient 
transcription elongation of the (3-globin gene (54). Our 
data suggest that the binding of the +60 ZF-DBD 
interferes with the LCR-mediated transfer of Pol II to 
the (3-globin promoter and with the release of Pol II 
from the promoter. These events, however, are linked 
because if there is a deficiency in elongation, fewer Pol 
II transcription complexes are recruited to the promoter 
and accumulate at the LCR. Thus, our data strongly 
support the hypothesis that (3-globin promoter associated 
Pol II is recruited from the LCR (48,49). 

In summary, the current study demonstrates that 
targeting a ZF-DBD to the adult (3-globin downstream 
promoter interferes with the recruitment and activity of 
Pol II. Thus, ZF-DBDs are useful tools for changing 
gene expression patterns. 
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